pet scan
CoCoLIT: ControlNet-Conditioned Latent Image Translation for MRI to Amyloid PET Synthesis
Sargood, Alec, Puglisi, Lemuel, Cole, James H., Oxtoby, Neil P., Ravì, Daniele, Alexander, Daniel C.
Synthesizing amyloid PET scans from the more widely available and accessible structural MRI modality offers a promising, cost-effective approach for large-scale Alzheimer's Disease (AD) screening. This is motivated by evidence that, while MRI does not directly detect amyloid pathology, it may nonetheless encode information correlated with amyloid deposition that can be uncovered through advanced modeling. However, the high dimensionality and structural complexity of 3D neuroimaging data pose significant challenges for existing MRI-to-PET translation methods. Modeling the cross-modality relationship in a lower-dimensional latent space can simplify the learning task and enable more effective translation. As such, we present CoCoLIT (ControlNet-Conditioned Latent Image Translation), a diffusion-based latent generative framework that incorporates three main innovations: (1) a novel Weighted Image Space Loss (WISL) that improves latent representation learning and synthesis quality; (2) a theoretical and empirical analysis of Latent Average Stabilization (LAS), an existing technique used in similar generative models to enhance inference consistency; and (3) the introduction of ControlNet-based conditioning for MRI-to-PET translation. We evaluate CoCoLIT's performance on publicly available datasets and find that our model significantly outperforms state-of-the-art methods on both image-based and amyloid-related metrics. Notably, in amyloid-positivity classification, CoCoLIT outperforms the second-best method with improvements of +10.5% on the internal dataset and +23.7% on the external dataset.
Efficient Parameter Adaptation for Multi-Modal Medical Image Segmentation and Prognosis
Saeed, Numan, Hardan, Shahad, Ridzuan, Muhammad, Saadi, Nada, Nandakumar, Karthik, Yaqub, Mohammad
Cancer detection and prognosis relies heavily on medical imaging, particularly CT and PET scans. Deep Neural Networks (DNNs) have shown promise in tumor segmentation by fusing information from these modalities. However, a critical bottleneck exists: the dependency on CT-PET data concurrently for training and inference, posing a challenge due to the limited availability of PET scans. Hence, there is a clear need for a flexible and efficient framework that can be trained with the widely available CT scans and can be still adapted for PET scans when they become available. In this work, we propose a parameter-efficient multi-modal adaptation (PEMMA) framework for lightweight upgrading of a transformer-based segmentation model trained only on CT scans such that it can be efficiently adapted for use with PET scans when they become available. This framework is further extended to perform prognosis task maintaining the same efficient cross-modal fine-tuning approach. The proposed approach is tested with two well-known segementation backbones, namely UNETR and Swin UNETR. Our approach offers two main advantages. Firstly, we leverage the inherent modularity of the transformer architecture and perform low-rank adaptation (LoRA) as well as decomposed low-rank adaptation (DoRA) of the attention weights to achieve parameter-efficient adaptation. Secondly, by minimizing cross-modal entanglement, PEMMA allows updates using only one modality without causing catastrophic forgetting in the other. Our method achieves comparable performance to early fusion, but with only 8% of the trainable parameters, and demonstrates a significant +28% Dice score improvement on PET scans when trained with a single modality. Furthermore, in prognosis, our method improves the concordance index by +10% when adapting a CT-pretrained model to include PET scans, and by +23% when adapting for both PET and EHR data.
AutoPETIII: The Tracer Frontier. What Frontier?
Mesbah, Zacharia, Mottay, Léo, Modzelewski, Romain, Decazes, Pierre, Hapdey, Sébastien, Ruan, Su, Thureau, Sébastien
For the last three years, the AutoPET competition gathered the medical imaging community around a hot topic: lesion segmentation on Positron Emitting Tomography (PET) scans. Each year a different aspect of the problem is presented; in 2024 the multiplicity of existing and used tracers was at the core of the challenge. Specifically, this year's edition aims to develop a fully automatic algorithm capable of performing lesion segmentation on a PET/CT scan, without knowing the tracer, which can either be a FDG or PSMA-based tracer. In this paper we describe how we used the nnUNetv2[1] framework to train two sets of 6 fold ensembles of models to perform fully automatic PET/CT lesion segmentation as well as a MIP-CNN to choose which set of models to use for segmentation.
PEMMA: Parameter-Efficient Multi-Modal Adaptation for Medical Image Segmentation
Saadi, Nada, Saeed, Numan, Yaqub, Mohammad, Nandakumar, Karthik
Imaging modalities such as Computed Tomography (CT) and Positron Emission Tomography (PET) are key in cancer detection, inspiring Deep Neural Networks (DNN) models that merge these scans for tumor segmentation. When both CT and PET scans are available, it is common to combine them as two channels of the input to the segmentation model. However, this method requires both scan types during training and inference, posing a challenge due to the limited availability of PET scans, thereby sometimes limiting the process to CT scans only. Hence, there is a need to develop a flexible DNN architecture that can be trained/updated using only CT scans but can effectively utilize PET scans when they become available. In this work, we propose a parameter-efficient multi-modal adaptation (PEMMA) framework for lightweight upgrading of a transformer-based segmentation model trained only on CT scans to also incorporate PET scans. The benefits of the proposed approach are two-fold. Firstly, we leverage the inherent modularity of the transformer architecture and perform low-rank adaptation (LoRA) of the attention weights to achieve parameter-efficient adaptation. Secondly, since the PEMMA framework attempts to minimize cross modal entanglement, it is possible to subsequently update the combined model using only one modality, without causing catastrophic forgetting of the other modality. Our proposed method achieves comparable results with the performance of early fusion techniques with just 8% of the trainable parameters, especially with a remarkable +28% improvement on the average dice score on PET scans when trained on a single modality.
The Impact of ChatGPT and LLMs on Medical Imaging Stakeholders: Perspectives and Use Cases
Yang, Jiancheng, Li, Hongwei Bran, Wei, Donglai
This study investigates the transformative potential of Large Language Models (LLMs), such as OpenAI ChatGPT, in medical imaging. With the aid of public data, these models, which possess remarkable language understanding and generation capabilities, are augmenting the interpretive skills of radiologists, enhancing patient-physician communication, and streamlining clinical workflows. The paper introduces an analytic framework for presenting the complex interactions between LLMs and the broader ecosystem of medical imaging stakeholders, including businesses, insurance entities, governments, research institutions, and hospitals (nicknamed BIGR-H). Through detailed analyses, illustrative use cases, and discussions on the broader implications and future directions, this perspective seeks to raise discussion in strategic planning and decision-making in the era of AI-enabled healthcare.
Machine learning is helping us understand Alzheimer's disease
A revolutionary Cornell-led study uses machine learning to understand the advancement of Alzheimer's disease in people who are either cognitively normal or experiencing mild cognitive impairment. The machine learning modelling showed that predicting the future decline into dementia for individuals with mild cognitive impairment is easier and more accurate than for cognitively normal or asymptomatic individuals. Researchers also found that the predictions for cognitively normal subjects are less accurate for longer time horizons, but the opposite is true for individuals with mild cognitive impairment. Cornell researchers also found that magnetic resonance imaging (MRI) is a useful prognostic tool for people in both stages. In contrast, tools that track molecular biomarkers, such as positron emission tomography (PET) scans, are more useful for people experiencing mild cognitive impairment.
MLC at HECKTOR 2022: The Effect and Importance of Training Data when Analyzing Cases of Head and Neck Tumors using Machine Learning
Thambawita, Vajira, Storås, Andrea M., Hicks, Steven A., Halvorsen, Pål, Riegler, Michael A.
Head and neck cancers are the fifth most common cancer worldwide, and recently, analysis of Positron Emission Tomography (PET) and Computed Tomography (CT) images has been proposed to identify patients with a prognosis. Even though the results look promising, more research is needed to further validate and improve the results. This paper presents the work done by team MLC for the 2022 version of the HECKTOR grand challenge held at MICCAI 2022. For Task 1, the automatic segmentation task, our approach was, in contrast to earlier solutions using 3D segmentation, to keep it as simple as possible using a 2D model, analyzing every slice as a standalone image. In addition, we were interested in understanding how different modalities influence the results. We proposed two approaches; one using only the CT scans to make predictions and another using a combination of the CT and PET scans. For Task 2, the prediction of recurrence-free survival, we first proposed two approaches, one where we only use patient data and one where we combined the patient data with segmentations from the image model. For the prediction of the first two approaches, we used Random Forest. In our third approach, we combined patient data and image data using XGBoost. Low kidney function might worsen cancer prognosis. In this approach, we therefore estimated the kidney function of the patients and included it as a feature. Overall, we conclude that our simple methods were not able to compete with the highest-ranking submissions, but we still obtained reasonably good scores. We also got interesting insights into how the combination of different modalities can influence the segmentation and predictions.
Machine learning gives nuanced view of Alzheimer's stages
A Cornell University-led collaboration has used machine learning to pinpoint the most accurate means and timelines for anticipating the advancement of Alzheimer's disease in people who are either cognitively normal or experiencing mild cognitive impairment. The modeling showed that predicting the future decline into dementia for individuals with mild cognitive impairment is easier and more accurate than it is for cognitively normal (or asymptomatic) individuals. At the same time, the researchers found that the predictions for cognitively normal subjects is less accurate for longer time horizons, but for individuals with mild cognitive impairment, the opposite is true. The modeling also demonstrated that magnetic resonance imaging (MRI) is a useful prognostic tool for people in both stages, whereas tools that track molecular biomarkers, such as positron emission tomography (PET) scans, are more useful for people experiencing mild cognitive impairment. The team's paper, "Machine Learning Based Multi-Modal Prediction of Future Decline Toward Alzheimer's Disease: An Empirical Study," published Nov. 16 in PLOS ONE.
Machine Learning and Alzheimer's disease: What You Need to Know
As one of the most popular research subjects at this year's Alzheimer's Association International Conference (AAIC), machine learning is at the forefront of Alzheimer's disease and related dementia (ADRD) innovation. While a seemingly complex topic, machine learning can be thought of as math class for computers. Like students, computers are trained to use different methods, or formulas, to solve a problem. There are many types of methods for training, and machine learning is used to solve a wide variety of problems. Most commonly, computers are trained using data sets of patient clinical history, demographic information, and/or brain imaging.
Deep learning score predicts PD-L1 status among patients with non-small cell lung cancer
A deep learning score accurately predicted PD-L1 expression among a cohort of patients with non-small cell lung cancer who underwent PET/CT scans, according to study findings published in Journal for ImmunoTherapy of Cancer. "This study is important, as it is the single largest multi-institutional radiomic study population of [patients with NSCLC] to date treated with immunotherapy who had PET/CT scans that were used to predict PD-L1 status and subsequent treatment response," Robert J. Gillies, PhD, chair of cancer physiology and vice chair of radiology research at Moffitt Cancer Center, said in a press release. "Because images are routinely obtained and are not subject to sampling bias per se, we propose that the individualized risk assessment information provided by these analyses may be useful as a future clinical decision support tool pending larger prospective trials." Gillies and colleagues developed a deep learning score to predict PD-L1 expression, durable clinical benefit, PFS and OS among 697 patients with NSCLC treated with immune checkpoint inhibitors across three institutions. According to study results, the score enabled researchers to distinguish between patients with PD-L1-positive and PD-L1-negative status.